Efficient Bag of Scenes Analysis for Image Categorization
نویسندگان
چکیده
In this paper, we address the general problem of image/object categorization with a novel approach referred to as Bag-of-Scenes (BoS).Our approach is efficient for low semantic applications such as texture classification as well as for higher semantic tasks such as natural scenes recognition or fine-grained visual categorization (FGVC). It is based on the widely used combination of i) Sparse coding (Sc), ii) Max-pooling and iii) Spatial Pyramid Matching (SPM) techniques applied to histograms of multi-scale Local Binary/Ternary Patterns (LBP/LTP) and its improved variants. This approach can be considered as a two-layer hierarchical architecture: the first layer encodes the local spatial patch structure via histograms of LBP/LTP while the second encodes the relationships between pre-analyzed LBP/LTP-scenes/objects. Our method outperforms SIFT-based approaches using Sc techniques and can be trained efficiently with a simple linear SVM.
منابع مشابه
Temporal Bag-of-Words - A Generative Model for Visual Place Recognition using Temporal Integration
This paper presents an original approach for visual place recognition and categorization. The simple idea behind our model is that, for a mobile robot, use of the previous frames, and not only the one, can ease recognition. We present an algorithm for integrating the answers from different images. In this perspective, scenes are encoded thanks to a global signature (the context of a scene) and ...
متن کاملLearning Tree-structured Descriptor Quantizers for Image Categorization
Current state-of-the-art image categorization systems rely on bag-of-words representations that model image content as a histogram of quantization indices that code local image appearance. In this context, randomized tree-structured quantizers have been shown to be both computationally efficient and yielding discriminative visual words for a given categorization task. This paper presents a new ...
متن کاملAn Image Based Approach for Content Analysis in Document Collections
We consider the task of content based analysis and categorization in large-scale historical book scanning projects. Mixed content, deprecated language, noise and unexpected distortions suggest an image based approach. The use of keypoint extractors combined with the bag of features approach is applied to scanned text documents. In order to incorporate spatial information into the bag of feature...
متن کاملLearning a Compositional Representation for Facade Object Categorization
Our objective is the categorization of the most dominant objects in facade images, like windows, entrances and balconies. In order to execute an image interpretation of complex scenes we need an interaction between low level bottom-up feature detection and highlevel inference from top-down. A top-down approach would use results of a bottom-up detection step as evidence for some high-level infer...
متن کاملContent Based Medical Image Retrieval System (CBMIRS) Using Patch Based Representation
This research work is to develop an efficient and powerful medical search engine to classify and search the radiographic medical images. It focuses on bag of visual words image representation and a similarity matching technique to represent match and retrieve the similar images. This work addresses the issues in content based image retrieval for medical images. In this system can handles differ...
متن کامل